Zeta: A Global Method for Discretization of Cotitinuous Variables
نویسندگان
چکیده
This paper introduces a new technique for discretization of continuous variables based on zeru, a measure of strength of association between nominal variables developed for this purpose. Zeta is defined as the maximal accuracy achievable if each value of an independent variable must predict a different value of a dependent variable. We describe both how a continuous variable may be dichotomised by searching for a maximum value of zeta, and how a heuristic extension of tbis method can partition a continuous variable into more than two categories. Experimental comparisons with other published methods, show that zeta-discretization runs considerably faster than other techniques without any loss of accuracy.
منابع مشابه
An Efficient Global Discretization Method Technical Report Number 296
The development of an effective and efficient method for discretization of continuous variables is an important problem to be solved in developing generally applicable methods for data mining. In Ho and Scott 1997, we describe a new technique for discretization of continuous variables based on zeta, a measure of strength of association between nominal variables. The old zeta method partitions a...
متن کاملZeta: A Global Method for Discretization of Continuous Variables
Discretization of continuous variables so they may be used in conjunction with machine learning or statistical techniques that require nominal data is an important problem to be solved in developing generally applicable methods for data mining. This paper introduces a new technique for discretization of such variables based on zeta, a measure of strength of association between nominal variables...
متن کاملA FUZZY MINIMUM RISK MODEL FOR THE RAILWAY TRANSPORTATION PLANNING PROBLEM
The railway transportation planning under the fuzzy environment is investigated in this paper. As a main result, a new modeling method, called minimum risk chance-constrained model, is presented based on the credibility measure. For the convenience ofs olving the mathematical model, the crisp equivalents ofc hance functions are analyzed under the condition that the involved fuzzy parameter...
متن کاملA global optimal algorithm for class-dependent discretization of continuous data
This paper presents a new method to convert continuous variables into discrete variables for inductive machine learning. The method can be applied to pattern classification problems in machine learning and data mining. The discretization process is formulated as an optimization problem. We first use the normalized mutual information that measures the interdependence between the class labels and...
متن کاملLocal and Global Approaches to Fracture Mechanics Using Isogeometric Analysis Method
The present research investigates the implementations of different computational geometry technologies in isogeometric analysis framework for computational fracture mechanics. NURBS and T-splines are two different computational geometry technologies which are studied in this work. Among the features of B-spline basis functions, the possibility of enhancing a B-spline basis with discontinuities ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999